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Locality and Applications to Subsumption Testing 
in €£L and Some of its Extensions 


Viorica SOFRONIE-STOKKERMANS! 


Abstract 


In this paper we show that subsumption problems in the description 
logics EL and EL* can be expressed as uniform word problems in 
classes of semilattices with monotone operators. We use possibilities of 
efficient local reasoning in such classes of algebras, to obtain uniform 
PTIME decision procedures for TBox and CBox subsumption in €L 
and €£L*. These locality considerations allow us to present a new 
family of (possibly many-sorted) logics which extend €£ and €L* with 
n-ary roles and/or numerical domains. 
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1 Introduction 


Description logics are logics for knowledge representation used in databases 
and ontologies. They provide a logical basis for modeling and reasoning 
about objects, classes (or concepts), and relationships (or links, or roles) 
between them. Recently, tractable description logics such as EL [2] have 
attracted much interest. Although they have relatively restricted expressivity, 
this expressivity is sufficient for formalizing the type of knowledge used in 
widely used ontologies such as the medical ontology SNOMED [26, 27]. 
Several papers were dedicated to studying the properties of EL and of its 
extensions (e.g. EL* [4]), especially to understanding the limits of tractability 
in extensions of €£. Undecidability results in extensions of €£ are obtained 
in [1] using a reduction to the word problem for semi-Thue systems. 
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In this paper we show that the subsumption problem in EL and ELT 
can be expressed as satisfiability problems for ground clauses w.r.t. local and 
stably local (extensions of) theories, for which methods for efficient (PTIME) 
checking of satisfiability of ground clauses exist. The results on local theories 
allow us to extend the tractability results to some extensions of EL and ELT 
with n-ary roles and/or numerical domains. The results were first presented 
in [24]. This paper extends [24] by giving more details on stably local theory 
extensions and providing additional examples. 


Structure of the paper. In Section 2 we present generalities on description 
logic and introduce the description logics EL and EL*. In Section 3 we show 
that CBox subsumption can be expressed as a uniform word problem in the 
class of semilattices with monotone operators satisfying certain composition 
axioms. In Section 4 we present general definitions and results on local and 
stably local equational theories and in Section 5 we show that the algebraic 
models of EL and EL* have local resp. stably local presentations, thus 
providing an alternative proof of the fact that TBox subsumption in €£ and 
CBozx subsumption in €£* are decidable in PTIME. Locality results for 
more general classes of semilattice with operators are used in Section 6 for 
defining tractable extensions of EL and ELT. 


2 Description Logics: Generalities 


The central notions in description logics are concepts and roles. In any 
description logic a set No of concept names and a set Np of roles is assumed 
to be given. Complex concepts are defined starting with the concept names in 
Ne, with the help of a set of concept constructors. The available constructors 
determine the expressive power of a description logic. 

The semantics of description logics is defined in terms of interpretations 
T = (D*,-*), where D7 is a non-empty set, and the function -? maps each 
concept name C' € Nc to a set C? C D* and each role name r € Nr toa 
binary relation r? C D? x D?. 

Table 1 shows the constructor names used in the description logic ALC 
and their semantics. The extension of -7 to concept descriptions is inductively 
defined using the semantics of the constructors. 


Terminology. A terminology (or TBox, for short) is a finite set consisting 
of primitive concept definitions of the form C = D, where C is a concept 
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Table 1: Constructors and their semantics 


Constructor name Syntax | Semantics 

negation a DAC 

conjunction Cyn Co Ce nq ers 

disjunction Cres) CP ue 

existential restriction | 4r.C {x | Sy ((a,y) € r? and y € C7)} 
universal restriction | Vr.C {x | Vy (if (x,y) € r? then y € C7)} 


name and D a concept description; and general concept inclusions (GCI) of 
the form CE D, where C' and D are concept descriptions. 


Interpretations. An interpretation Z is a model of a TBox 7 if it satisfies: 


e all concept definitions in T, i.e. C’=D* for all definitions C=D € T; 


e all general concept inclusions in T, i.e. C1CD* for every CLD € T. 


Since definitions can be expressed as double inclusions, in what follows we 
will only refer to TBoxes consisting of general concept inclusions (GCI) only. 


Definition 1 Let T be a TBoz, and Ci, C2 two concept descriptions. C1 is 
subsumed by C2 w.r.t. T (for short, Cy Ez C2) if and only if C? C Cy for 
every model T of T. 


Relationships between concepts and roles are described using CBoxes. 


Constraint box. A CBox consists of a terminology 7 and a set RI of 
role inclusions of the form r, o--- or, EC s. (Since any terminology can be 
expressed as a set of general concept inclusions, in what follows we will view 
CBoxes as unions GCI U RI of a set GCI of general concept inclusions and 
a set RI of role inclusions of the form 71 0--- orp CE s.) 


Interpretation. An interpretation Z is a model of the CBox C = GCIURI 
if it is a model of GCI and satisfies all role inclusions in C, i.e. r?o---orZ C s* 
for all ry o---or, CsE RI. 


Definition 2 If C is a CBoz, and C,,C2 are concept descriptions then 
C; Ee Co if and only if GF e Ce for every model f of C. 
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2.1 The Description Logics €£ and EL* 


By restricting the type of allowed concept constructors, less expressive but 
tractable description logics can be defined. If we only allow intersection 
and existential restriction as concept constructors, we obtain the description 
logic EL [2], a logic used in terminological reasoning in medicine [26, 27]. In 
[4], the extension ELt of EL with role inclusion axioms is studied. It was 
shown that subsumption w.r.t. CBoxes in E£* can be reduced in linear time 
to subsumption w.r.t. normalized CBoxes, in which all GCIs have one of the 
forms: 


CLD, Clar.D, 


CynNcolD, Ar.ClyD, 


where C, C1, C2, D are concept names, and all role inclusions are of the form 


ros or ryorlr. 


Therefore, in what follows, we consider w.l.o.g. that CBoxes only contain 
role inclusions of the form r Cs and rj org Er. 


3 Algebraic Semantics for EL and €L* 


We show that CBox subsumption for €£ and EL* can be expressed as a 
uniform word problem for classes of semilattices with monotone operators. 


3.1 Algebra: Preliminaries 


We assume known notions such as partially-ordered set and order filter /ideal 
in a partially-ordered set. For further information cf. [8]. A structure (L, A) 
consisting of a non-empty set L together with a binary operation / is called 
semilattice if A is associative, commutative and idempotent. A structure 
(L,V, A) consisting of a non-empty set L together with two binary operations 
V and A on LF is called lattice if V and A are associative, commutative 
and idempotent and satisfy the absorption laws. A distributive lattice is a 
lattice that satisfies either of the distributive laws (D,) or (Dv), which are 
equivalent in a lattice. 


(Dy) Va,y,2 w@A(yVz)—=(@Ay)V (are) 
(Dy) Vie 2VGAs)=—@V yA =): 
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A lattice having both a first and a last element is called bounded. A Boolean 
algebra is a structure (B,V,/A,7,0,1), such that (B,V,A,0,1) is a bounded 
distributive lattice and — is a unary operation that satisfies: 


(Complement) Ve Sar Veo ae n= 0 


Let V be a class of algebras. The universal Horn theory of V is the collection 
of those closed formulae valid in V which are of the form 

Var... V2n( \ Si] = 832 9 ty = ta) (1) 

i=1 

The formula (1) above is valid in V if for each algebra A € V and each 
assignment v of values in A to the variables, if v(s;1) = v(si2) for all 
i€{1,...,n} then v(t,) = v(tz).? The problem of deciding the validity of 
universal Horn sentences in a class V of algebras is also called the uniform 
word problem for Y. It is known that the uniform word problem is decidable 
for the classes: SL of semilattices (in PTIME), DL of distributive lattices 
(co-NP-complete), and Bool of Boolean algebras (co-NP-complete). 


3.2. An Algebraic Semantics for Description Logics 


A translation of concept descriptions into terms in a signature naturally 
associated with the set of constructors can be defined as follows. For every 
role name r, we introduce unary function symbols, f3, and fy,. The renaming 
is inductively defined by: 


e C=C for every concept name C; 


e C=C; CynCg=CyAC2g, CyUuC2,=C1V Co; 


° 70 = f:,(©), WO=f,(O). 


Set theoretical semantics. There exists a one-to-one correspondence 
between interpretations of description logics, Z = (D,-7) and Boolean 
algebras of sets (P(D),U,N, 7,0, D, { far, for }renp), together with valuations 
v: No — P(D), where fa;, fyr are defined, for every U C D, by: 


fa(U) = {x| Ay ((2,y) € 17 andy €U)} 
for(U) = {a | Vy (if (z,y) € r~ then y € U)}. 


?If A is an algebra and v : X > A an assignment, then v extends in a canonical way to 
a homomorphism ¥ from the algebra of terms with variables X to A. For every term t 
with variables in X we will, for the sake of simplicity, write u(t) instead of U(t). 
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Let v : No + P(D) with v(A) = A? for all A € No, and let & be the (unique) 
homomorphic extension of v to terms. Let C be a concept description and 
C be its associated term. Then C? = 0(C) (denoted by C’). 


Boolean algebras with operators. Let BAOj,, be the class of all Boolean 
algebras with operators (B,V,/A,7,0,1, { far, fur }renp), Where 


e fz, is a join hemimorphism, ie. fa-(~Vy) = far(x)V far(y), far(0) = 0; 
e fv, is a meet hemimorphism, i.e. fyr(2 A y)=fer(x) A for(y), Arr(1)=1; 
e fyr(z) = 7f3,(72) for every x € B. 


It is known that the TBox subsumption problem for ALC can be expressed 
as uniform word problem for Boolean algebras with suitable operators. 


Theorem 1 If 7 is an ALC TBox consisting of general concept inclusions 
between concept terms formed from concept names Nc = {Cj,...,Cn}, and 
D,, Dz are concept descriptions, the following are equivalent: 


(1) DiC, De. 


(2) P(D) KVC\...Cp \\ CSD] > DisD2 
CLEDET 
for all interpretations I = (D,-*), 
where P(D) = (P(D),U,N, 7,9, D, { far, fir}renp): 


¥, 


(3) BAOn, EVC1..-Cn \\ CSD] > Di<Dz 
CLODET 


Proof: The equivalence of (1) and (2) follows from the definition of Dj Ey Do. 
(3) = (2) is immediate. (2) = (3) follows from the fact that every algebra 
in BAOy, homomorphically embeds into a Boolean algebra of sets. 


3.3. An Algebraic Semantic for €L* 


In [19] we studied the link between TBox subsumption in €£ and uniform 
word problems in the corresponding classes of semilattices with monotone 
functions. We now show that these results naturally extend to the description 
logic E£L*. Consider the following classes of algebras: 
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e BAO}, the class of boolean algebras with operators 
(B,V,A,7,0,1, {fartreng), Such that f3, is a join hemimorphism; 


e DLO*X, the class of bounded distributive lattices with operators 
(L,V,A,0,1,{far}reng), such that f3, is a join hemimorphism; 


e SLOK, the class of all A-semilattices with operators 
(S,A,0,1,{far}renp), such that far is monotone and f3r(0) = 0. 3 


Assume given a set RI of axioms of the form r EC s and rj org Lr, with 
r1,72,T,8 © Nr. We associate with RI the following set of axioms: 


RIq = {Va aa Gar) < far(&) | T1 OT? i re RI} U 
{Va faa) = file) |e se RI} 


Let BAO, (RI) (resp.DLOX,, (RI), SLOW, (RJ) be the subclass of BAON,, 
(resp. DLO3,,. SLOX,) consisting of those algebras which satisfy RI,. 


Lemma 1 Let Z = (D,-7) be a model of an EL* CBox, C = GCIU RI. 
Then (P(D),N,{far}renp) € SLOW, (RI). 


Proof: Clearly, (P(D),, {far}renpz) € SLOX,- Let 71,72, rE Np and UEP(D). 


far(U) = {a|dyeU st. (2,y) ert} C fa(U) ifr; Cre RI 
far, (fara(U)) = {@ | 3y st. (wy) € rq and y € far(U)} 
= {x | Jys.t. (2, y) ert and 4z € U with (y,z) € 13} 
= {x|4zeU st. (x,z) € (ri0r2)7} 
@ . fae(U) ifrjora2l re RI. 


Theorem 2 Every S € SLO}, (RI) embeds into (the bounded semilattice 
reduct of) a lattice in DLON, (RI). Every lattice in DLON, (RI) embeds into 
(the bounded lattice reduct of) an algebra in BAOX,, (RJ). 


Proof: Let S = (S,A,0,1,{fs}fex) be a semilattice with 0, 1, and with 
monotone operators in © such that fs(0) = 0. Consider the lattice of all 


3For the sake of simplicity, in what follows we assume that the description logics EL 
and EL contain the additional constructors L, T, which will be interpreted as 0 and 1. 
Similar considerations can be used to show that the algebraic semantics for variants of EL 
and E£* having only T (or L) is given by semilattices with 1 (resp. 0). 
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order-ideals of S$, OI(S) = (OI(S),A,U, {0}, S, {fs} fen), where join is set 
union, meet is set intersection, and the additional operators in ¥ are defined, 
for every order ideal U of S, by fg(U) =\fs(U). Note that 


_— F({0}) {0} 
F(UiV U2) = Lf(U1 V U2) =1(f(U1) U f(U2)) = Lf(O1) UL F(U2). 


Thus, OJ(S) € DLOx,,- 4 Moreover, 7 : S + OI(S) defined by n(x) := 
an injective fiompmorphiek w.r.t. the operations in SLOn,, ie. n(f5(x 
Lfs(x) = fg({x). Let r1 org Cr € RI, and let U € OI(S). Then: 


far(U) = +fr(U), 
far, (far, (U)) far, (Fae (U)) — far, Gen (U)). 


The second statement is a consequence of Priestley duality for distributive 
lattices. Let L € DLO}, (RI). Let F, be the set of prime filters of L, and 
B(L) = (P(Fp),U,9, {far}ren,), where for r € R, f3, is defined by 


{x is 
y= 


fa-(U) = {F € Fp |AG EU: fa-(G) C FH. 


Let i: L + B(L) be defined by i(x) = {F € F, | « € F}. Obviously, 7 is a 
lattice homomorphism. We show that i(f3,(2)) = f3,(i(2)). 


fap (ix) 


{F €F,|3aG:x2€G and fa,(G) C F} 
ie € Fp | far (2) € Ey = AGE Cale 


IM 


To prove the converse inclusion, let F € i(f3,(x)). Then F € F, and 
far(z) € F. Thenx €G= 5, (FP). As F is a prime filter, and f3, is a join 
hemimorphism, G = iP) is a prime filter with « € G and f3,(G) C F, 
so Fé f3,(i(x)). 

Finally, we show that B(L) satisfies the axioms in RI,. Let U € B(L). 
Then: 


fa,U) = {Fe F,|4G1 €U: fa.(Gi) C FY, 
far(far,U)) = {F € Fp|5G1,4G, €U: fa,.(Ge) C Gi and fa,,(Gi) C F} 
C {F € Fp | 4G, cU: far, (far. (Ga)) C Bt. 


4A similar construction can be made starting from A-semilattice with monotone opera- 
tors which have only 1 (resp. 0) or neither 0 nor 1. 
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Assume that r; C r € RI. Then for all x, fa,,(4) < fa-(z). Let F € 
fap,(U). Then far,(Gi) C F for some G; € U, so also f3,(G1) C F. Hence, 
far,(U) © fs,-(U). Similarly we can prove that ifr) org Er € RI then 
Far: (far,(U)) Cc fa,(U). 


Theorem 3 If the only concept constructors are intersection and existential 
restriction, then for all concept descriptions D,,D2 and every EL* CBox 
C=GCIURI, with concept names No = {C\,...,Cy} the following are 
equivalent: 


(2) SLOH,, (RI) FE VC1...Cn \\ CSD] > Di<Dz 
CLDEGCI 


Proof: We know that CiLcCo iff CF EGy for every model Z of the CBox C. 

We first prove that (1) > (2). Assume that (2) holds. Let Z = (D,-7) 
be an interpretation that satisfies C. Then (P(D),M,0,D,{far}renp) € 
SLO}, (RI), hence (P(D),1,0,D, {far}reve) F (Accpecer $ D) > 
Di < Dy. As Z is a model of GCI, C’ C D” for all CC D € GCI, so 
DE =Dy CDy = Di. 

To prove (1) = (2) note that, by Theorem 1, if Dj C7 Dez then 
BAOn, EK (AccoecC $D) + Di < Dy. Let $ € SLOX, (RI). By 
Lemma 2, S embeds into an algebra in BAON, which satisfies RIg. Therefore, 
SE (AccnecorC < D) +O < G. 


We will show that the word problem for the class of algebras SLOW, (RI ) 
is decidable in PTIME. For this we will prove that SLOW, (RI ) has a “local” 
presentation. The general locality definitions, as well as methods for rec- 
ognizing local presentations are given in Section 4. We show how to apply 
these methods for the class of algebraic models of EL and EL* in Section 5. 


4 Local Theories; Local Theory Extensions 


First-order theories are sets of formulae (closed under logical consequence), 
typically the set of all consequences of a set of axioms. Alternatively, we 
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may consider the set of all models of a theory. In this paper we consider 
theories specified by their sets of axioms. (At places, however, we will refer 
to a theory, and mean the set of all its models.) Before defining the notion of 
local theory and local theory extension we will introduce some preliminary 
notions on partial models of a theory. 


4.1 Partial and Total Models 


A partial model is a model in which some function symbols may be partial. In 
this paper the models we consider are partially ordered algebraic structures, 
i.e. the only predicates are < and =. 


Definition 3 A weak Il-embedding between the partial structures A = 

({Astses, (fahren, {Pa} perrea) and B = ({Bs}ses, {fa} pes, {PB} Pecrred) 

is a (many-sorted) family i = (is)ses of total maps i, : As > Bs such that 

e if fa(ai,...,@n) is defined then also fp(is,(a1),---,%s,(Gn)) is defined 
and is(fa(ar, tee Gn)) = fB(is, (a1), sets , is, (@n)); 

e for each s, is is injective and an embedding w.r.t. Pred i.e. for every 


P € Pred with arity s1...S, and every a1,...,Qn where a; € As,, 
Pal@i322:5;0,) of and only af Ps(ts,(61), +. +; 4a, (Gn))- 


In this case we say that A weakly embeds into B. 


Definition 4 Let A be a partial structure and 8: X > A be a valuation. 


e We say that (A, 8) & ty = te iff at least one of the following conditions 
holds: 


(a) B(t1), B(t2) are defined and B(t1) = B(t2), or 
(b) B(ti) and B(t2) are undefined, or 
(c) B(ti) is defined, ta=f(s1,...,5n) and B(s;) is undefined for some 
1, OT 
(d) if B(t1) ts defined, tg = f(s1,...,8n) and B(s;) ts defined for all 
i then B(t2) has to be defined and 3(t1) = B(t2). 
e (A,B) & ti < te ts defined similarly, replacing “=” with “<” in 
(a)-(d). 
e We say that (A,B) — ty F te if at least one of the following conditions 
holds: 
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(a’) B(ti), B(t2) are defined and B(t1) # B(t2), or 
(b’) B(t1) or B(t2) are undefined. 


e (A, 8) satisfies a clause C (notation: (A,B) E C) af it satisfies at least 
one literal in C’. 


Definition 5 A is an (Evans) partial model of a set of clauses K if (A,B) 
C for every valuation 8 and every clause C in K. 


Definition 6 Let A be a partial structure and 6: X > A be a valuation. 


e We say that (A,B) Fw (7)P(t1,..-,tn) (A, 8) weakly satisfies (4) P(t1,...,tn)), 
with P € PredU{=} if either B(t;) are all defined and (=) P4(G(t1),.--, B(tn)) 
is true in A, or B(t;i) is not defined for some argument t; of P. 


e (A, () weakly satisfies a clause C (notation: (A, 8) Ew C) if it weakly 
satisfies satisfies at least one literal in C. 


Definition 7 We say that A is a weak partial model of a set of clauses K 
if (A,B) Ew C for every 6: X > A andC eK. 


4.2 Local Theories 


The notion of local theory was introduced by Givan and McAllester [11, 12]. 
They studied sets of Horn clauses K with the property that, for any ground 
Horn clause C, K — C only if already K[C] — C' (where K[C] is the set of 
instances of K in which all terms are subterms of ground terms in either 
K or C). Since the size of K[C] is polynomial in the size of C' for a fixed 
K and satisfiability of sets of ground Horn clauses can be checked in linear 
time [9], it follows that for local theories, validity of ground Horn clauses 
can be checked in polynomial time. Givan and McAllester proved that every 
problem which is decidable in PTIME can be encoded as an entailment 
problem of ground clauses w.r.t. a local theory [12]. The property above can 
be easily generalized to the notion of locality of a set of (Horn) clauses: 


Definition 8 A local theory is a set of Horn clauses K such that, for any 
set G of ground Horn clauses, KUG EL if and only if already K|GJUG EL, 
where K|G] is the set of instances of K in which all terms are subterms of 
ground terms in either K or G. 
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The same considerations as above can be used to show that in any local theory 
satisfiability of sets of ground Horn clauses can be checked in polynomial time. 
In [10], Ganzinger established a link between proof theoretic and semantic 
concepts for polynomial time decidability of uniform word problems which 
had already been studied in algebra [18, 7]. In the course of this work he 
introduced and studied, besides locality, also the less restrictive notion of 
stable locality for equational Horn theories. 


Definition 9 A set K of Horn clauses is stably local if for every set G of 
ground clauses, if KUG EL then G can be refuted using the set KIC] of all 
instances of K obtained by instantiating the variables with (ground) subterms 
of G, i.e. if 

KUGEL #@f and only if KGluG aa 


The more general notion of -stably local theory (in which the instances to 
be considered are described by a closure operation WV) is introduced in [13]. 
Let K be a set of clauses. Let Vx be a function associating with any set T 
of ground terms a set Vx(T) of ground terms such that 


(i) all ground subterms in K and T are in Ux (T); 
(ii 


) for all sets of ground terms T,T”’ if T C T’ then Ux(T) C Ux (T"); 
(iii) for all sets of ground terms T, U«(Ux(T)) C Ux (T); 
) 


(iv) W is compatible with any map h between constants, i.e. for any map 
h:C—>C, Uc(h(T)) = h(Vx(T)), where h is the unique extension of 
h to terms. 


Let K!%«(@)] be the set of instances of K where the variables are instantiated 

with terms in Ux (st(K,G)) (set denoted in what follows by Vxc(G)), where 

st(KC,G) is the set of all ground terms occurring in K or G. We say that K 

is W-stably local if it satisfies: 

(SLoc”) for every finite set G of ground clauses, KUGK L iff KIYx(@lUG 
has no partial model in which all terms in Vx (G) are defined. 


In the particular case when Vx(G) = st(K, G) we obtain again the notion of 
stable locality (the corresponding condition is denoted SLoc). 


Theorem 4 (Complexity) If a set K of Horn clauses satisfies (SLoc”) 
then satisfiability of any set G of Horn clauses w.r.t. K is decidable in 
polynomial time in the size of Vx(G). 
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Proof: This follows from the fact that KI¥<(@] UG is a set of ground Horn 
clauses of size polynomial in the size of Ux(G), and satisfiability of sets 
of ground Horn clauses (in a relational encoding, taking into account only 
suitable instances of the congruence axioms — which are also Horn and not 
more than |W<(G)|?) can be checked in linear time ((9], see also [10]). 


4.2.1 Recognizing Stably Local Theories 


Locality can be recognized by proving embeddability of partial into total 
models [20, 25, 13]. Theories satisfying (SLoc”) can be recognized by showing 
that Evans partial models of 7; embed into total models. 


Theorem 5 Let K be a set of clauses. Assume Vx satisfies conditions (i)— 
(iv) above, and that every Evans partial model of K with the property that 
the set of defined terms is closed under Vx weakly embeds into a total model 
of K. Then K. satisfies SLoc™. 


Proof: Let G be a set of ground clauses. We show that, under the given 
assumptions, if CUG 1 then K/¥x(@] UG has no partial algebra model in 
which all (ground) terms in Ux (G) are defined. Assume that K/Y’«(@] UG 
has a partial Evans model P in which all (ground) terms occurring in Vx (G) 
are defined. We construct a partial model A of K UG as follows. Let 
A= {tp | t © Vxc(G)}. As we want A to be a model of KUG in Evans’ sense, 
we need to make sure that if f is an n-ary function and oo ...,t8 € A and 
(f(#',...,t"))p is defined and equal to, say, tp € A, then fa(th,...,¢) is 
defined in A and equal to tp. Thus, we impose that f4(th,...,t%) is defined 
and yields tp as a result iff tp = f(t!,... ,t”) p € A. We show that the set 
of defined terms in A is closed under Vx. Note first that, by definition of A, 
for any ground term t, t, is defined if and only if there exists t'/ € Ux(G) 
such that t4 = t!,. Thus, 


Def(A) = {t | t ground term ,t,4 defined} = h(Vx(G)), 


where h is the unique homomorphism which extends the map h with h(c) = 
cp for every constant c occurring in Ux(G). Then: 


Wx(Def(A)) = Vic(R(Wx(G))) = BV (G))) € h(W(G)) = Def(A). 


By condition (i), all ground literals occurring in G are defined in P and 
(by construction) also in A. Therefore, A satisfies a ground literal L which 
occurs in G iff P satisfies L. Hence, A satisfies all clauses in G. 
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It remains to show that A satisfies K. Let DEK, and 8: X > A. For 
every x € X there exists at least one t € Vx(G) with G(x) = tp. Thus, there 
exists at least one substitution o : X + Vx(G) such that h(o(t)) = B(t) for 
all terms t, where fA is the canonical projection which associates with every 
term t its interpretation tp in P. Then o(D) is an instance of D in KM«(@], 
We know that P is a model of KI*(@], hence (P,h) K o(D). Therefore 
(A, 8) FD. 

Thus, A satisfies K UG. Therefore, A weakly embeds into a total model 
B of K. It is easy to see that B satisfies the same ground literals as A, so B 
satisfies all clauses in G. Thus, B is a model of KUG, so KUG FL. 


4.3. Local Theory Extensions 


We will also consider extensions of theories, in which the signature is extended 
by new function symbols (i.e. we assume that the set of predicate symbols 
remains unchanged in the extension). 

Let 7p be an arbitrary theory with signature Hp = (So, Xo, Pred), where 
So is a set of sorts, Uo a set of function symbols, and Pred a set of predicate 
symbols. 

We consider extensions 7; of Jo with signature II = (S,%, Pred), where 
the set of sorts is S = Sg US} and the set of function symbols is © = MgU X41 
(i.e. the signature is extended by new sorts and function symbols). We assume 
that 71 is obtained from 7p by adding a set K of (universally quantified) 
clauses in the signature II. Thus, Mod(7;) consists of all II-structures which 
are models of K and whose reduct to IIg is a model of Jo. 


Terminology. Everywhere in what follows, when referring to weak (resp. 
Evans) partial models of Tg UK, To UK[G] resp. To UK'@] we mean weak 
(resp. Evans) partial models of whose reduct to Ig is a total model of To. 


4.3.1 Locality and Stable Locality of an Extension 


Let K be a set of clauses in the signature II. In what follows, when we 
refer to sets G of ground clauses we assume that they are in the signature 
II = (S$, U Xe, Pred), where X, is a set of new constants. 


If T is a set of ground NyoUh;U™%,-terms, where ¥, is a set of (new) constants, 
we denote by Kr the set of all instances of K in which all terms starting 
with a 4 -function symbol are ground terms in T. We denote by K7 the set 
of all instances of K in which all variables occurring below a %1-function 
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symbol are instantiated with ground terms in the set Ty,(Z’) of No-terms 
generated by 7. 


If G is a set of ground clauses and T = st(K, G) is the set of ground subterms 
occurring in either K or G then we write K[G] := Kr, and KIGl := KT, 


We will focus on the following type of locality of a theory extension 7p C 71, 
where Jj = Jo UK with K a set of (universally quantified) clauses: 


(Loc) For every finite set G of ground clauses 7}UG — iff ToUK|[G]UG 
has no weak partial model with all terms in st(K,G) defined. 


(SLoc) For every set G of ground clauses 7] UG KL iff To UK UG 
has no (Evans) partial model with all terms in st(K, G) defined. 


We say that an extension 79 C 7; is local if it satisfies condition (Loc). (Note 
that a local equational theory [10] is a local extension of the pure theory of 
equality (with no function symbols).) The extension 7p C J; is stably local 
if it satisfies condition (SLoc). 

Notions of W-locality can be defined as in the case of local theories [13]. 
In W-local theories and theory extensions hierarchical reasoning is possible. 
We present the ideas for the case of local and stably local theory extensions. 


4.3.2 Hierarchical Reasoning 


Consider a local theory extension 79 C Jo UK. The locality conditions 
defined above require that, for every set G of ground clauses, 7g UK UG 
is satisfiable if and only if Jo UK[G] UG has a weak partial model whose 
IIg-reduct is a total model of Jo and in which all the terms in st(K,G) are 
defined. 

All clauses in K[G] UG have the property that the function symbols 
in 4, have as arguments only ground terms. Therefore, K[G] UG can be 
flattened and purified (i.e. the function symbols in ©; are separated from the 
other symbols) by introducing, in a bottom-up manner, new constants c; for 
subterms t = f(g1,---,9n) with f € 41, g; ground No U X,-terms (where ¥, 
is a set of constants which contains the constants introduced by flattening, 
resp. purification), together with corresponding definitions c; = t. 

The set of clauses thus obtained has the form Kp U Go U D, where D 
is a set of ground unit clauses of the form f(g1,...,gn) = c¢, where f € 4, 
c is a constant, g1,...,gn are ground terms without function symbols in 
X1, and Kp and Go are clauses without function symbols in /1. Flattening 
and purification preserve both satisfiability and unsatisfiability w.r.t. total 
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algebras, and also w.r.t. partial algebras in which all ground subterms which 
are flattened are defined [20]. 


For the sake of simplicity in what follows we will always flatten and then 
purify K[G] UG. Thus we ensure that D consists of ground unit clauses of 
the form f(c1,...,¢n) =c, where f € ¥4, and cj,...,¢n,c are constants. A 
similar transformation can be performed starting with KI@l UG. 


Lemma 2 ([20]) Let K be a set of clauses. Assume that To C To UK is a 
local (or stably local) theory extension. For any set G of ground clauses, let 
Ko UGo U D be obtained from K[G|] UG by flattening and purification, as 
explained above. Then the following are equivalent: 


(1) ToUK * [G]UG has a partial model with all terms in st(K,G) defined. 
(2) ToUKpUGoUD has a partial model with all terms in st(K,G) defined. 


(3) To UKgp UGo U No has a (total) model, where 


No={\ a=& a c=4| fla,...,en) =, f(di,...,dn) =d € D}. 
i=1 


Here, K * [G] is K|G] if To C To UK its a local theory extension and K x |G] 
is KIG] if T C ToUK is a stably local theory extension. 


A similar hierarchical reduction can be performed also for W-local (or stably 
local) theory extensions. 


Theorem 6 ({20]) Assume that the theory extension To C Ti satisfies 
either (1) condition (Loc) or (2) (SLoc) and Jo is locally finite. 

If all variables in the clauses in K occur below some function symbol 
from 1 and if testing satisfiability of ground clauses in To is decidable, then 
testing satisfiability of ground clauses in Ty is decidable. 


4.3.3. Recognizing Local Theory Extensions 


The locality or stable locality of an extension can be recognized by proving 
embeddability of partial into total models [20, 25, 13]. We will use the 
following notation: 
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PModf4(=1,71) is the class of all weak partial models of J, in which the 
»1,-functions are partial and have a finite domain of 
definition and all function symbols in No are total. 

PMod‘4(1, 71) is the class of all (Evans) partial models of J; in which the 
»1,-functions are partial and have a finite domain of 
definition and all function symbols in No are total. 


For theory extensions Jo C 7; = Jo UK, where K is a set of clauses, we 
consider the following conditions: 


(Emb‘?) Every A € PMod'(5), 71) weakly embeds into a total model of 7J;. 
(Emb) Every A € PMod‘(x,, 7) weakly embeds into a total model of 7;. 


In what follows we say that a non-ground clause is /-flat if function 
symbols (including constants) do not occur as arguments of function symbols 
in },. A ¥j-flat non-ground clause is called %j-linear if whenever a variable 
occurs in two terms in the clause which start with function symbols in %4, 
the two terms are identical, and if no term which starts with a function 
symbol in © contains two occurrences of the same variable. 


Theorem 7 ({20, 25]) Let K be a set of U-flat and S-linear clauses. If 
the extension To C T, satisfies (Emb*) then it satisfies (Loc). 


A similar criterion can be given for stably-local theory extensions by com- 
bining ideas in the proofs of similar results in [20, 25]. 


Theorem 8 Let Jo be a universal theory and K be a set of clauses. If the 
extension To C Ti satisfies (Emb") then it satisfies (SLoc). 


Similar results hold also for W-locality of an extension (cf. e.g. [13, 14]). 


5 Locality and Complexity of EL* and EL 


We now show that the classes of algebraic models of ELT and of EL have 
presentations which satisfy certain locality properties. This gives an alter- 
native, algebraic explanation of the fact that CBox subsumption in these 
logics is decidable in PTIME and makes generalizations possible. 


268 V. Sofronie-Stokkermans 


5.1 Locality and €L* 


In this section we prove that the class SLOy(RJ) of semilattices with mono- 
tone operators in a set % satisfying a family RI, of axioms of the form 


Va Fi lesel galt) tex) = Fe) 


has a local presentation, and therefore the uniform word problem w.r.t. 
this class can be decided in polynomial time. For the sake of simplicity we 
restrict, w.l.o.g., to axioms as above with n € {1,2}. 

It is known that the theories of lattices and semilattices allow local 
Horn axiomatizations (cf. e.g. [18, 7]). Let SZ be such an axiomatization for 
the theory of semilattices. We denote by Mon(X) the set {Mon(f) | f € U}, 
where 


Mon(f) Va,y(27<y— f(x) < f(y)). 


Theorem 9 The set of Horn clauses SLU Mon() U RI, has the property 
that every Evans partial model A with the properties: 
(i) for every f ©, fa is a partial function with finite definiton domain; 
(ii) for each axiom in RI, of the form fi(fo(x)) < f(x), and everyae A, 
if f(a) is defined then f2(a) is defined in A; 
(iti) AE SLU Mon()U RI; 
weakly embeds into a total model of SLU Mon(X) U R1g. 
Proof: Let A be an Evans partial model of SZUMon(")U RI, with properties 
(i)—-(iii). In particular, A is a poset, hence it embeds into a complete 


(semi)lattice S such that the meets that exist in A are preserved. (We will 
think of A as a subset of S$.) For every f € © we define f : 5 > S by 


f(a) = MFO |a<c,c€A, fa(c) is defined}. 
For every f € ¥, f is monotone (see e.g. also [25]). We show that the axioms 


in RI, are satisfied by these extensions. 


e Let fi(z) < fo(z) € RI, andaeéS. Then: f;(a) = A{fi(c) | a < 
c,c € A, fi(c) is defined}. Let d € A with a < d and f2(d) defined. 
Then f;(d) is also defined and f;(d) < fo(d). Thus, f,(a) < fo(d) for 
alld € A with a < d and fo(d) defined, so f,(a) < f(a). 


°From Definitions 4 and 5 it follows that then also f,(f2(a)) is defined. 
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e Let now fi(fo(x)) < f(x) € RI, anda e€ S$. Then f.(a) = A{fo(c) | 
a<c,cé€ A, fo(c) is defined}. Then for every a < c, if fo(c) is defined 


then f(a) < fo(c). We prove that f;(f2(a)) < f(a). 

Note first that if a < ¢ and fi(f2(c)) is defined then f(a) < fo(c). 
Therefore, f1(fo(c)) € {fi(c1) | fo(a) < ci, and fi(c1) defined}. Hence, 
{fi(fa(c)) | a < ¢, fi(fa(e)) defined} C { filer) | fo(a) < 1, filer) defined}. 


Therefore, the infimum of the first set is larger than the infimum of 
the second set. Hence: 


fi(fo(a)) 


ACW EAC RAC co 
A{filfe(o)) |a<e and fi(fo(c)) defined} 
Mif@ | a <cand f(c) defined} = f(a). 


IA 


IA 


The last inequality is a consequence of the fact that if f(d) is defined in 
A then f2(d) is defined in A, and since A — RIg, fi(fo(d)) is defined in A 
and fi(f2(d)) < f(d). Hence, \{fi(fo(c)) | @< ¢ and fi(f2(c)) defined} < 
fi(fo(d@)) < f@). 


Corollary 1 The following are equivalent: 


(1) SLU Mon(%) U RI, - Vz /\ si(Z) < si(%) > s(Z) < s'(z); 
i=1 


(2) SLUMon(X)U RIg UGE L, where G = \\i_, si(@)<si(@)As(@)£s' (0); 
(3) (SLUMon(S)URIZ)21@lUuG EL where Vp7(G) =U; i>0 0 Wigy, with 
WRy=st(G), and WR ={ fod) | f(d) € Vay, (frofe)(@) Sf (a) € RIo}. 


Here st(G) is the set of all (ground) subterms occurring in G. Note that 
Wp1(G) can have at most |st(G)|-|NpR| elements. Thus, its size is polynomial 
in the size of G. On the other hand, the number of clauses in (S2ZUMon()U 
RI,)%21()1 is polynomial in |Vgz(G)|, and satisfiability of any set of ground 
clauses can be tested in polynomial time. This shows that the uniform word 
problem for the class SLOy(RJ) (and thus also for SLO} p(R1)) is decidable 


in polynomial time. 
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Example 1 We illustrate the ideas on an example presented in [4] (here 
slightly simplified). Consider the CBox C consisting of the following GCT: 


Endocard LE Tissue M dcont-in.HeartWall 1 dcont-in.HeartValve 
HeartWall C Spart-of .Heart 
HeartValve E Jpart-of .Heart 
Endocarditis E Inflammation M Shas-loc.Endocard 
Inflammation LC Disease 
Heartdisease = Disease [ Shas-loc. Heart 


and the following role inclusions RI: 
cont-in o cont-in LE cont-in 
part-of E cont-in 
has-loc o cont-in EC has-loc 


We want to check whether Endocarditis Ee Heartdisease. This is the case iff 
(with some abbreviations — e.g. foi stands for facont-in aNd fpo for fapart-of ; 
hy and hy for HeartWall resp. HeartValve, e for Endocard, h for Heart, etc.): 


SL U Mon(fei, fn, fpo) U {Va fei(fei(@))<fei(2), 
Var jolts fale), 
Va fri(fei(x))< fri(x) t 
U {e <ta fal hw) A Fal tial hiw < deal hy < Tpolh), 
Endocarditis <i A fr(e), i<d, Heartdisease = dA fri(h), 
Endocarditis < Heartdisease} F L. 
Then st(K, G) = { fei(hw); fe(hu), fpolh), Srie), fri(h)}- To compute VK(G), 
note that Wr = st(K, G), Wri = {fei(e), fei(h)}, and Ver = Ure 
Thus, Vx (G) = {fei(hw), fei(hv), fei(e), fei(h), fpo(h), rile), fri(2)F- 
After computing (RIg UMon( fei, fri, fpo) UCon) YO] and SLYO! we obtain 
the following conjunction of (Horn) ground clauses: 


G (RI, A Mon A Con) OIA SLIMY) 
Ext falhw) A fei(hy) fei(fei(z)) < fa(z) for x € UxK(G) 
hig = Fpa(h) foal) S fa) for x € UK(G) 
he < fpo(h) Fri(fei()) < fa(x) for x € UK(G) 
Endocarditis < iA fni(e) 

i<d tRy > fi(x)Rfaly) for x,y € Vc(G) 


G 
Heartdisease = dA fri(h) tRy > fpo(x)Rfpoly) for x,y € Vx(G) 
Endocarditis < Heartdisease | Ry > fri(x)Rfi(y) for xz,y € Ve(G) 


Re{S,2:4} 


yA) 
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By Corollary 1, Endocarditis Ce Heartdisease iff 6 = GA (RI, A Mon A 
Con)!¥()] A SLIY@) is unsatisfiable. Note that ¢ is a set of ground clauses 
in first-order logic with equality, containing all instances of the congruence 
axioms corresponding to the (ground) terms which occur in @. 

A translation to Datalog can easily be obtained by replacing the function 
symbols with binary predicate symbols. Alternatively, we can process the 
instances in @ by replacing, in a bottom-up fashion, all the terms starting 
with function symbols (which are all ground) with new constants (and adding, 
separately, the corresponding definitions) (cf. e.g. the remarks in [10, 7). 

The satisfiability of @ can therefore be checked automatically in poly- 
nomial time in the size of d which in its turn is polynomial in the size of 
Wx(G). Hence, in this case, the size of ¢ is polynomial in the size of G. 
Note however that if U(G) has 7 elements, the size of (RIg/\ Mon Con)! 
will be 168. The number of instances of clauses in SLY@!] is c-73 where 
c is a constant. Thus, although the number of instances which we need to 
consider is polynomial in the size of G, it can sometimes be large. 


Unsatisfiability can also be proved directly: G entails the inequalities: 


(1) Endocarditis < (dA fr(e)); (2) e< (failhw) A fei(hw)); 

(3) (hw < fpo(h)); (4) (hv S fpo(h)). 
Hence G A (RI, \ Mon A Con)! ae € = feilfpalt)) S foil fal) < Jal h). 
Thus, G \ (RIq \ Mon A Ue TE fale) < fri(feilh)) < fri(h), so 
GA (RIq \ Mon A Con)¥(@] E Endocarditis < dA fni(h), which together 


with dA fr(h) = Heartdisease and Endocarditis < Heartdisease leads to a 
contradiction. 


5.2 Locality and €L 


In [19] we proved that the algebraic counterpart of the description logic EL, 
namely the class of semilattices with monotone operators — axiomatized by 
SLU Mon(=) — has an even stronger locality property, namely for every set 
G of ground clauses 


SLUMon(X)AG EL if and only if (SLUMon(%))[G] AGEL 


where K[G] is the set of instances of K containing only ground terms occurring 
in G. In fact, we showed that the extension of the theory SL of semilattices 
with a family of monotone functions is local in the sense defined in [20]. 
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Theorem 10 ([25]) Let G be a set of ground clauses. The following are 
equivalent: 


(1) SLUMon(©) UG EL. 


(2) SLUMon(X)[G]UG has no partial model A such that its {A}-reduct is 
a (total) semilattice and the functions in © are partially defined, their 
domain of definition is finite and all terms in G are defined in A. 


Let Mon(X)|G]o U Go U Def be obtained from Mon(%)[G] UG by purification, 
i.e. by replacing, in a bottom-up manner, all subterms f(g) with f € X, with 
newly introduced constants cf(g) and adding the definitions f(g) = cz to the 
set Def. The following are equivalent (and equivalent to (1) and (2)): 


(3) Mon(%)[GJo U Go U Def has no partial model (A, A, {fa} pes) such that 
(A, A) is a semilattice and for all fed, fa is partially defined, its 
domain of definition is finite and all terms in Def are defined in A; 


(4) Mon(X)[G]o U Go is unsatisfiable in SL. 
(Note that in the presence of Mon(%) the instances Con|G]o of the 
congruence axioms for the functions in © are not necessary.) 


Con[G]o = {9=9' > epg) =cr(g’y | (9) =Crg), (9!) =crigy € Def}. 


This equivalence allows us to hierarchically reduce, in polynomial time, proof 
tasks in SLU Mon(%) to proof tasks in SL (cf. e.g. [25]) which can then be 
solved in polynomial time. ° 


Example 2 We illustrate the method on an example first considered in [2]. 
Consider the EL TBox T consisting of the following definitions: 


Aj P, Apo TI. T2.A3 
Ao = P» Az T2.o7r, Ay 
Az; = P3f Ag ary (Pi r] P2) 


® We could prove a similar theorem in the presence of role inclusion axioms for certain 
types of role inclusions. An extension to general role inclusions — which would provide 
more efficient instantiations, and therefore more efficient algorithms than those provided 
by Corollary 1 — is subject of work in progress. 
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We want to prove that P3M AgM4r;.(A, Az) Ez Ag. We translate this 
subsumption problem to the following satisfiability problem: 


SLU Mon(fi, f2) U {a1 = (pi Naz A fi(fe(as))), 
ag = (pz Aa3 A fo(fi(ar))), 
a3 = (p3 \ a2 A fi(pi A p2)), 
a(p3 A a2 A fi(ar A a2) < a3)} EL. 


We proceed as follows: We flatten and purify the set G of ground clauses by 
introducing new names for the terms starting with the function symbols fy, 
or fo. Let Def be the corresponding set of definitions. We then take into 
account only those instances of the monotonicity and congruence axioms for 
fi and fo which correspond to the instances in Def, and purify them as well, 
by replacing the terms themselves with the constants which denote them. We 
obtain the following separated set of formulae: 


Def GoA (Mon(f1, f2)[G])o 
fo(a3) =c1 | (1 =piANagAc) ake, >Re, RE {<,>} 
fics) =cC (a2 = po a3 A c4) a3Rc3 > «Rea, RE 1<s >} 
fi(ai) = 3 | (a3 =p3Aa2Ad1) aiRe1 > e3Rdi, RE {<,>} 
fo(cz) = C4 (p3 A az A dz L a3) a, Reg > c3Rd2, RE {x4 mat 
fier) = di | pi Apo = e1 eRe, > Rd, RE{<,>} 
fi(e2) = dy | a, \ag =e cj Rez > coRds, Re 1 >} 
e, Reg > d, Rd, Re 1S >} 


The subsumption is true iff Go A (Mon(fi, f2)[G])o is unsatisfiable in the 
theory of semilattices. We can see this as follows: note that a, \ ag < 
pi \ pa, te. €2 < e1. Then (using an instance of monotonicity) dz < d,, so 
p3 \ a2 \ dz < p3 \ a2 A dy = a3. 

This can also be checked automatically in PTIME either by using the fact 
that there exists a local presentation of SL or using the fact that SL = I'S P(S2) 
(i.e. every semilattice is isomorphic with a sublattice of a power of Sz), where 
S2 is the semilattice with two elements, hence SL and S2 satisfy the same 
Horn clauses. Since the theory of semilattices is convex, satisfiability of 
ground clauses w.r.t. SL can be reduced to SAT solving. 


6 Extensions of €£ and €L* 


The results described in Section 5 can easily be generalized to semilattices 
with n-ary monotone functions satisfying composition axioms. This allows 
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Table 2: Constructors for E£ with n-ary roles and their semantics 


Constructor | Syntax Semantics 
conjunction | Cy C2 Cine? 
existential AR(C1,...Cn) | {2 | Sy,---59n (2, 915---5Yn) € R® and y; € cry 


us to define natural generalizations of EL and EL*. We start. by presenting 
a generalization of E£ in which n-ary roles are allowed. We then sketch 
possible extensions in which role inclusions are also taken into account. 


6.1 Extensions of €L 
6.1.1 Extensions with n-ary Roles 


We consider extensions of €£ with n-ary roles. The semantics is defined in 
terms of interpretations Z = (D7, -7), where D7 is a non-empty set, concepts 
are interpreted as usual, and each n-ary role R € Np is interpreted as an 
n-ary relation R? C (D*)" (cf. Table 2). 


The possibility of considering n-ary roles can help in modeling certain notions 
more precisely than in situations in which we are restricted to binary roles. 
In medical ontologies, for instance, it is important to be able to express 
facts of the form “Disorder D has morphology M at site S”, modeled using a 
ternary relation morphology-at-site as morphology-at-site(D, M, S') [28]. 


6.1.2 Extensions with n-ary Roles and Numerical Domains 


A further extension is obtained by allowing for certain concrete sorts — having 
the same support in all interpretations; or additionally assuming that there 
exist specific concrete concepts which have a fixed semantics (or additional 
fixed properties) in all interpretations. The extensions we consider are 
different from the extensions with concrete domains and those with n-ary 
quantifiers studied in the description logic literature (cf. e.g. [5, 3]). 


Example 3 Consider a description logic having a usual (concept) sort and 
a ‘concrete’ sort num with fired domain N. We may be interested in general 
concrete concepts of sort num (interpreted as subsets of R) or in special 
concepts of sort num such as tn, {n, or [n,m] for m,n € R. For any 
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interpretation LT, tn? = {x €R|2> nh}, ln? ={2 €R| a < n}, and 
[n,m]? ={r ER|n<a2<m}. We will denote the arities of roles using a 
many-sorted framework. Let (D,R,-7) be an interpretation with two sorts 
concept and num. A role with arity (s1,...,5n) 1s interpreted as a subset of 
Ds, X +++ X Ds,, where Deoncept = D and Daum = R. 


1. Let price be a binary role of arity (concept, num), which associates with 
every element of sort concept its possible prices. The concept 


dprice.tn = {x | dk > n: price(x, k)} 


represents the class of all individuals with some price greater than n. 


2. Let has-weight-price be a role of arity (concept,num, num). The concept 


5 has-weight-price.(ty, |p) = {a | Sy’>y, dp'<p and has-weight-price(z, y’, p’)} 


denotes the family of individuals for which a weight above y and a price 
below p exist. 


The example above can be generalized by allowing a set of concrete sorts. 
We discuss the algebraic semantics of this type of extensions of EL. 


Let SLO, g denote the class of all structures 
(S,P(A1), aus »P(An), { far | re Np}), 


where S is a semilattice, A1,..., A, are concrete domains, and { f3, | r € Nr} 
are n-ary monotone operators. We may allow constants of concrete sort, 
interpreted as sets in P(A;). The classes DLOX,,,5 and BAOK,.s of all 
distributive lattices resp. Boolean algebras with concrete supports and n-ary 
join hemimorphisms { f3,|r € Nr} are defined similarly. 


Theorem 11 I[f the only concept constructors are intersection and existen- 
tial restriction, then for all concept descriptions D,, Dg, and every TBox T 
consisting of general concept inclusions GCI the following are equivalent: 


(1) Dy Er Do. 


(2) SLOF 


R 


REVOu ign \ C<D|>Di<Dy 
CEDEGCI 
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Proof: Analogous to the proof of Theorem 3. 


Let SLg be the class of all structures A = (A, P(A1),...,P(An)), with sig- 
nature IT = (S,{A}U%, Pred) with S={concept,s1,...,S,}, Pred={<}U{C; 
1<i<nb}, where A € SL, the support of sort concept of A is A, and for all 
i the support sort s; of A is P(A;). 


Theorem 12 ((25]) Every structure (A, P(A1),...,P(An), {fa} fen), where 
(i) (A, P(A1),..-,P(An)) € SLs, and 

(ii) for every fEX of arity s1...8n—78, fa is a partial function from 

[]j_, Us, to Us with a finite definition domain on which it is monotone 


((where Uconcept = A and U;, = P(A;)), 


weakly embeds into a total model of SLOy,5 (axiomatized by SLgUMon(x)). 


Corollary 2 Let G = Aj, si(@)<si(@ A s(@)£s'(@) be a set of ground unit 
clauses in the extension II° of II with new constants 4. The following are 
equivalent: 


(1) SLg UMon(X) UG EL. 


(2) SLg UMon()[G] UG has no partial model with a total {As }-reduct 
in which all terms in G are defined. 


Let Jj) Mon(X)[G]; U G; U Def be obtained from Mon(X)[G] UG by purifi- 
cation, i.e. by replacing, in a bottom-up manner, all subterms f(g) of sort s 
with f € X, with newly introduced constants cr(g) of sort s and adding the 
definitions f(g) = c¢ to the set Def. We thus separate Mon(X)|G] UG into a 
conjunction of constraints T; = Mon(X)[G]; U G;, where To is a constraint 
of sort semilattice and for 1 <i<n, TI; is a set of constraints over terms of 
sort i (i being the concrete sort with fixed support P(A;)). Then the following 
are equivalent (and are also equivalent to (1) and (2)): 


(3) Ujip(Mon(X)[G]; U G;) U Def has no partial model with a total {Agsz}- 
reduct in which all terms in Def are defined. 


(4) Ujio(Mon(%)[G]; U Gi) is unsatisfiable in the many-sorted disjoint 
combination of SL and the concrete theories of P(A;), 1<i<n. 
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The complexity of the uniform word problem of SLgUMon(=) depends on the 
complexity of the problem of testing the satisfiability — in the many-sorted 
disjoint combination of SL with the concrete theories of P(A;), 1<i<n— 
of sets of clauses Cooncept U Gian C;, U Mon, where Ceoncept and C; are unit 
clauses of sort concept resp. s;, and Mon consists of possibly mixed ground 
Horn clauses. 


Specific extensions of the logic E£ can be obtained by imposing ad- 
ditional restrictions on the interpretation of the “concrete”-type concepts 
within P(A;). For instance, we can require that numerical concepts are 
always interpreted as intervals, as in Example 3. 


A tractable fragment. In what follows we analyze the situation in which 
the concepts of sort num are described using the Ord-Horn fragment of 
Allen’s interval arithmetic [17] (fragment which has the property that all 
operations and relations between intervals can be represented by Ord-Horn 
clauses, i.e. clauses over atoms x < y, x = y, containing at most one positive 
literal (x < y or x = y) and arbitrarily many negative literals (of the form 


a #y)). 


Theorem 13 Consider the following extensions of EL with n-ary roles: 
(1) The one-sorted extension of EL with n-ary roles. 


(2) The extension of EL with two sorts, concept and num, where the 
semantics of classical concepts is the usual one, and the concepts of 
sort num are interpreted as elements in the Ord-Horn, convex fragment 
of Allen’s interval algebra [17], where any TBox can contain many- 
sorted GCI’s over concepts, as well as constraints over the numerical 
data expressible in the Ord-Horn fragment. 


In both cases, TBox subsumption is decidable in PTIME. 


Proof: The fact that CBox subsumption is decidable in PTIME in case (1) 
is an immediate consequence of results in [25]. 

We prove that this holds also for case (2). The assumption on the 
semantics of the extension of EL we made ensures that all algebraic models 
are two-sorted structures of the form 


A= ((A, A); Int(R, O), {fahrex), 


with sorts {concept, num}, such that (A,/) is a semilattice, Int(R,O) is 
an interval algebra in the Ord-Horn fragment of Allen’s interval arithmetic 
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[17], and for all f € ©, f4 is a monotone (many-sorted) function. We will 
denote the class of all these structures by SLordHorn. The fact that TBox 
subsumption is decidable in PTIME now follows from the following facts: 


e The theory of semilattices is convex; 


e Nebel and Biirckert [17] proved that a finite set of Ord-Horn clauses is 
satisfiable over the real numbers iff it is satisfiable over posets. As the 
theory of partial orders is convex, this means that although the theory 
of reals is not convex w.r.t. <, we can always assume that the theory 
of Ord-Horn clauses is convex. 


The main result in Corollary 2 can be adapted without problems to show 
that if G = Aji, si(@)<si(@ A s(@)£s'(@) is a set of ground unit clauses 
in the extension II° of II with new constants ©,., and if Mon()[G]. U 
Mon(%)[G]num U Ge U Gnum U Def are obtained from Mon(=)(G] U G by 
purification, the following are equivalent: 


© SLordHorn U Mon(%) UG EL; 


e Mon(X)[G]o U Go U Con|[Def]o is unsatisfiable in the combination of SL 
and the Ord-Horn fragment of Allen’s interval arithmetic. 


In order to test the unsatisfiability of the latter problem we proceed as 
follows. We first note that, due to the convexity of the theories involved and 
to the fact that all constraints in Go U Mon(%)[G]o U Con[Def]o are separated 
(in the sense that there are no mixed atoms) in order to test satisfiability 
of Go U Mon(X)[G]o U Con[Def]o we need to test entailment of the premises 
of Mon()[G]o U Con[{Def]o from Go; when all premises of some clause are 
provably true we delete the clause and add its conclusion to Go. The PTIME 
assumptions for concept subsumption and for the Ord-Horn fragment ensure 
that this process terminates in PTIME. 


Example 4 Consider the special case described in Example 3. Assume that 
the concepts of sort num used in any TBox are of the form tn, {lm and [n,m]. 
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Consider the TBoxz T consisting of the following GCIs: 


{ Aprice({n1) CC affordable, 
Sweight(tmi) car C_ truck, 
has-weight-price(fm,Jn) © Aprice({n) M Sweight(tm), 
{n = tn, 
time tma; 
CC car, 
C C 4G has-weight-price(tm,{n) } 


In order to prove that C Ey affordable truck we proceed as follows. We 
refute Apcprer D < D AC ¢ affordable A truck. We purify the problem 
introducing definitions for the terms starting with existential restrictions, 
and express the interval constraints using constraints over Q and obtain the 
following set of constraints: 


Def eae Ceoncept Mon 
Force Oa) = 41 m<ny | cy < affordable ny<noad<e 
foriceelin) =e m > my, | dy Acar < truck ny>nocj>ec 
fweight (tm1) = dy é<cnd m>m>d,<d 
fweight (tm) =i C < car mi <m- dy = d 
inet) =e C<e 

C & affordable A truck 


The task of proving C C7 affordable M truck can therefore be reduced to 
checking if Coum U Cooncept U Mon is satisfiable w.r.t. the combination of 
SL (sort concept) with LI(Q) (sort num). For this, we note that Chum 
entails the premises of the first, second, and fourth monotonicity rules. 
Thus, we can add c < cy and d < dy to Cooncept- Thus, we deduce that 
C <edAcar < (cA d) Acar < cy A (dy Acar) < affordable A truck, which 
contradicts the last clause in Ceoncept- 


6.2 Extensions of €L* 
Role inclusions for n-ary roles can be also quite useful in applications. 


Example 5 Consider an example similar to Example 3, in we additionally 
have a ternary role has-price-place with arity (concept, num, concept). The 
statement has-price-place(x,n,p) means that individual x has price n at 
location p. The concept 


has-price-place.(tn, C) = {x | dn’>n, Jy € C and has-price-place(x, n’, y)} 
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represents the set of all individuals for which a price below n in some location 
in C' exists. Let is-located be a binary role. Then 


has-price-place o is-located C_ has-price-place 


expresses the fact that if the database contains the information that object x 
has price n in place p, and if p is located in p' then object x has price n in 
place p’. (for instance if x has (some) price n in Spain and we know that 
Spain is located in Europe then x has (some) price n in Europe). 


Thus, for roles with arbitrary arity (possibly many-sorted) we can also 
consider role inclusion constraints of the form rj org C r. This means 
that, for every interpretation Z = (D, Aj,..., An), if (v1,...,@n) € rz and 
(n,-++;2n4~) € 7S then the tuple (71,...,¢n—1,2n41,---,) nek) € 7. The 
corresponding composition rule at algebraic level is: 


far, (U2,--+)Un—1, fare(Unti,-++)Unte)) = 
= {y, | dy € Ui,2 <i<n—-1,4y€U;,n+1<i<ntk, 

(Yrs Yn+1s-+-1Yntk) € 7g and (y1,¥2,---5¥n) ETT} = 
= {y, | dy:cU;, such that for2<i<n+k,iFn, 
Wis Heys + Ya—ay Yas «93 Yaak) ETE OTL} G 
C {y | dy; € U;, such that for2<i<n+kiFn, 

Visti tas ae asa eT eR 
Ee 271 (5 Re CF ea eee RTE 6 oe 


Theorem 14 The set of Horn clauses SEU Mon()U RIq, where the func- 
tions in % may be n-ary, has the property that every Evans partial model A 
such that: 


(i) for every fEX, fa is a partial function with a finite definition domain; 


(ii) for each axiom in RIq of the form: 


VE i ess 5 Dy ih 
CHAE cee iy Pa Pikes Bak) < Fiske Ci ae pas aes) 
and all ay,...,Qntk € A, if fa(@i,-.--,@n—1,An41,---,;AntK) 18 defined 


then fo4(GQn41;---,@n+k) ts defined in A; 
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(itt) AE SLU Mon(%)U RE; 


weakly embeds into a total model of SLU Mon(X) U R1g. 


Proof: Similar to the proof of Theorem 9. 


7 Conclusions 


In this paper we have shown that subsumption problems in E£ can be 
expressed as uniform word problems in classes of semilattices with monotone 
operators, and that subsumption problems in €£L* can be expressed as 
uniform word problems in classes of semilattices with monotone operators 
satisfying certain composition laws. This allowed us to obtain, in a uniform 
way, PTIME decision procedures for EL, €£L*, and extensions thereof. These 
locality considerations allow us to present a family of PTIME (many-sorted) 
logics which extend EL with n-ary roles and/or with numerical domains. 
Some of these extensions are similar and some are different from other types 
of extensions studied in the description logic literature such as extensions 
with n-ary existential quantifiers (cf. e.g. [3]) or with concrete domains [5]. 

In future work we will use the results presented in this paper and the 
results in [21, 22, 14] for studying interpolation properties in extensions of EL 
and for analyzing possibilities of efficient (modular) reasoning in combinations 
of ontologies based on extensions of EL. It would be interesting to also 
investigate which of the complexity (tractability/dichotomy) results for EL 
and its extensions (cf. e.g. [16, 15]) are consequences of the properties of 
their algebraic models. 
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